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Abstract 



The Cherenkov light flashes produced by Extensive Air Showers are very short in 
time. A high bandwidth and fast digitizing readout, therefore, can minimize the 
influence of the background from the light of the night sky, and improve the per- 
formance in Cherenkov telescopes. The time structure of the Cherenkov image can 
further be used in single-dish Cherenkov telescopes as an additional parameter to re- 
duce the background from unwanted hadronic showers. A description of an analysis 
method which makes use of the time information and the subsequent improvement 
on the performance of the MAGIC telescope (especially after the upgrade with an 
ultra fast 2 GSamples/s digitization system in February 2007) will be presented. 
The use of timing information in the analysis of the new MAGIC data reduces the 
background by a factor two, which in turn results in an enhancement of about a 
factor 1.4 of the flux sensitivity to point-like sources, as tested on observations of 
the Crab Nebula. 

Key words: Gamma-ray astronomy, lACT, Cherenkov images, timing analysis 



1 Introduction 



Imaging Atmospheric Cherenkov Telescopes (lACTs) collect the Cherenkov 
light from Extensive Air Showers (EAS) to form an image. The morphology 
of the shower image [1] is used to recognize the few 7-ray initiated show- 
ers among the much more numerous hadronic showers initiated by cosmic 
ray nuclei. This standard approach only exploits the knowledge of the spa- 
tial distribution of the Cherenkov photons in the camera plane, but further 
informations regarding the shower development are in principle available in 
the photon arrival times [2]. The possibility of using effectively the timing 
information to improve the performance of lACTs has been explored in ear- 
lier works. The HEGRA collaboration measured on their data a time gradient 
along the major axis in the Cherenkov images [3j. They suggested that this 
information may be useful to estimate the distance to the shower core and the 
shower direction in the case of a single Cherenkov telescope, but of limited 
use in an array of lACTs, where a stereoscopic view of the shower is available. 
A recent MC study |1] suggests that the use of the time profile of Cherenkov 
images may lead to important background rejection improvements in future 
Cherenkov instruments (even if, according to [5] , pioneering tests on real data 

* Corresponding author. 
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data led only to marginal improvements). A different approach to exploit the 
time information is proposed by the authors of [6], capitalizing on the differ- 
ent characteristic time spread of the images of gamma-initiated air showers as 
compared to hadronic showers or images from distant single muons. 

The MAGIC (Major Atmospheric Gamma Imaging Cherenkov) telescope is 
a single-dish Cherenkov telescope, designed for the detection of VHE gamma 
rays in the ~ 50 GeV to ~ 10 TeV band [7]. Its camera is composed of 577 
pixels equipped with high quantum efficiency photomultiplier tubes (PMTs). 
In the first years of operation of MAGIC, the PMT signals were digitized with 
300 MSamples/s Flash Analogic to Digital Converters (FADCs). In February 
2007 the data acquisition of the MAGIC telescope was upgraded with ultra- 
fast FADCs capable to digitize at 2 GSamples/s [8i9j . The implementation of a 
faster readout might lead to an improvement in the telescope performance for 
two reasons: a reduction in the amount of NSB (Night Sky Background) light 
integrated with the real signal, and an improvement in the reconstruction 
of the timing characteristics of the recorded images. The main aim of this 
work is to establish whether the timing information is useful in the analysis 
of single-dish lACT data. In the following we will present an analysis method 
which makes use of signal timing, and compare its performance to that of the 
standard MAGIC analysis used up to now. 

The timing analysis proposed here is composed of two different parts. The first 
is the use of the time information to enhance the efficiency and to lower the 
threshold of the image cleaning procedure, thanks to the introduction of time 
constraints. The second is the use of additional time-related image parameters 
in the algorithms for the suppression of the isotropic background of hadron- 
initiated showers. Although the possibility of using timing to improve the 
lACT technique was suggested a long time ago, this is, to our knowledge, the 
first time in which it has been successfully applied to real data. 



2 Analysis method 

When an atmospheric shower triggers the MAGIC telescope, the information 
of all camera pixels is stored by the Data AcQuisition (DAQ) system. This 
information consists mainly of the digitized pulse of the PMT corresponding 
to each pixel in time slices of 0.5 nanoseconds. From the digital information 
of the pulse it is possible, through the so-called signal extractor routine, to 
reconstruct the number of photons that arrived at the pixel and their mean 
arrival time. This can be done in several manners. For the current MAGIC 
data (with 2 GS/s sampling), a simple cubic spline is built from the FADC 
readout, and its integral in a range around the highest peak provides a measure 
of the charge recorded by the pixel. The arrival time is defined as the position 
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of the rising edge of the pulse at 50% of the peak value. Before the upgrade of 
the FADC system, the pulse shape and duration was dictated by the artificial 
stretching introduced in the electronic chain to ensure that the pulse spanned 
over several FADC samples (then taken every 3.3 ns). For those older data, the 
digital filter algorithm [10], which makes use of the known pulse shape [1 Ij , 
was used. After calibration, the charge (Q) is converted to photo-electrons 
units (phe). Details about the calibration can be found in [12J. 

2.1 Image Cleaning 

The information from the pixels is first used to perform the image cleaning, 
that aims at identifying which pixels belong to the shower image. In figure [T] 
an example of an event before and after the cleaning is shown. 

In the MAGIC Analysis and Reconstruction Software (MARS [13]), different 
cleaning methods can be chosen by the user. The most commonly used is the 
standard - absolute method. The choice may depend on the sky around the 
source (galactic or extra-galactic) or the prevailing atmospheric conditions. 
This procedure uses a threshold signal value qi (a fixed value in terms of phe) 
to select the core pixels, namely all those with charge above qi and which 
have at least one neighbor fulfilling the same conditioij^. In a second stage, 
all pixels which have at least one core neighbor, and whose charge is above q2 
(with q2 < qi), are included in the image (these are called boundary pixels). 

Relaxing the cleaning levels qi and q2 results in a larger number of pixels per 
image, and accordingly a lower analysis energy threshold, since a minimum 
number of pixels is needed to proceed with the analysis. On the other hand, 
a low cleaning level increases the probability to include in the cleaned image 
a noise pixel (mainly due to NSB or other unwanted light pollution). The 
inclusion of pixels unrelated to the shower degrades the image parameters and 
worsens the performance of the subsequent analysis. 

Together with the signal intensity also an arrival time value is assigned by the 
signal extractor to each pixel. These times can be used to further constrain 
the selection of core and boundary pixels in the image cleaning algorithm: 
Cherenkov flashes are very brief (of the order of few ns), and NSB photons 
produce pulses asynchronous with respect to the pulses of the shower image. 
A timing coincidence window between the mean arrival time and the single 
pixel arrival time can avoid to confuse NSB signals with real image tails. This 
further constraint allows to relax the cleaning levels qi and q2, lowering in this 
way the energy threshold. The time information has already been used for the 

^ This additional requirement avoids the selection of pixels unrelated to the image 
whose large charge results from an afterpulse in the PMT. 
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image cleaning in the analysis of the observations of the Crab Nebula with 
MAGIC [H]. However, the algorithm used in that analysis differs from the 
one we are proposing here and can be found in detail in [15j . 
The procedure used in this work can be summarized in this way: 

• After selecting the core pixels in the same way as in the standard procedure, 
we reject those whose arrival time is not within a time Ati of the mean 
arrival time of all core pixels. 

• In the selection of the boundary pixels we add the constraint that the time 
difference between the boundary pixel candidate and its neighbor core pixels 
is smaller than a second fixed time constraint At2- 

The charge levels of the standard cleaning commonly used in the past in 
the analysis of MAGIC data are qi = 10 phe for the core pixels and q2 = 
5 phe for the boundary pixels. For the time-cleaning approach, the charge 
threshold levels were decreased to 6 and 3 phe respectively. Concerning the 
time constraints, the values Ati = 4.5 ns and At2 = 1.5 ns were selected. 
The choice of these values is supported by a study based on Monte Carlo data 
(see IIS] and [T7] for more details), in which we have assumed "dark night" 
conditions (and hence the used criteria would not be optimal during moon 
light or twilight observations when the number of noise photons is higher). 
The setting of these time constrains resulted also not very critical for choices 
within ~ Ins respect to the values used here. 

In figure [T] an example event is shown. The image footprint is visible in the 
arrival time display (upper right plot) because of the short duration of the 
Cherenkov flash, illustrating the validity of the time image cleaning approach. 
The arrival times of the signal pixels are distributed within few ns. The other 
pixels have, as expected, a random arrival time distribution. In the second 
and third rows of figure [T] the same event is plotted after applying different 
cleaning methods. 



2.2 Timing characteristics of the shower images 

As previously introduced, Cherenkov images present some timing features, the 
most important of which is a dependency between the timing profile along the 
major axis of the image and the Impact Parameter (IP) of the shower. The 
model proposed in |3] explains well this relationship. In case of a small impact 
parameter (IP < 60 m), the light emitted in the higher part of the shower (the 
shower head) will arrive delayed with respect to the light emitted in the lower 
part of the shower (the tail), since the photons emitted first travel slower (at a 
speed c/n) than the ultra relativistic particles of the shower that produce the 
photons at lower altitudes. In case of a larger impact parameter (IP > 120 m), 
the effect just described is reduced or even inverted, as the arrival time from the 



6 




Fig. 1. Illustrative 7-event images (Monte Carlo data, Energy=71 GeV, Impact Pa- 
rameter=lll m). First row: display of raw recorded data (left) and arrival times 
information (right). Second row: comparison of standard cleaning with 10-5 phe 
minimum charge levels (left) and 6-3 minimum charge levels (right). Bottom: image 
obtained with the time image cleaning (6-3 phe minimum charge levels and 4.5 ns 
and 1.5 ns as time constrains). The simulated gamma-ray source is located in the 
center of the camera (yellow star). 

tail becomes the sum of the times spent in the paths of particles and photons, 
respectively. In this latter situation, the photons emitted in the lower part 
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Fig. 2. Relative arrival time distributions of photons, averaged over a large sample of 
Monte Carlo 7-events (Energy=100 GeV) for several values of the impact parameter. 
The black solid line marks the 50% of the maximum photon density whereas the 
dashed line (~10% of the maximum) represents qualitatively the border of the 
image after the cleaning procedure. The time profile of the recorded images changes 
clearly depending on the IP of the primary shower. The source is located in the 
(0,0) position. Plots from [18]. 

of the shower will arrive later than the photons emitted in the upper part. 
Events with an intermediate impact parameter show a flat time profile. These 
features are well visible in the templates of average Monte Carlo gamma-ray 
images on the MAGIC camera (figure [2]), created by the superposition of many 
events at fixed values of energy and impact parameter. These are part of a 
dedicated MC sample produced for a different study [18] on the applicability of 
the so-called "model analysis" ([IS]) to the MAGIC data. In these templates, 
it is possible to recognize the dependency of the timing structure with the IP: 
the arrival time increases from shower head (bottom part of the images) to 
shower tail at large impact parameter, and from tail to head for small impact 
parameters. 



2.3 Definitions of time-parameters 

In order to exploit the timing characteristics of the showers in the analysis 
stage, some time-related image parameters have to be introduced. A linear fit 
of the arrival time versus the distance along the major image axis provides an 
easy way to characterize the time profile of a shower image. Another useful 
quantity may be the overall spread of the arrival times of all pixels surviv- 
ing the cleaning. Based on these considerations, two new time-related image 
parameters have been introduced: 

• Time Gradient: this parameter measures how fast the arrival time changes 
along the major image axis. The pixel coordinates are projected onto this 
axis, reducing the problem to one dimension. Then the arrival time ver- 
sus the space coordinate along the major axis is fitted to a linear function 
t = m ■ X -\- q. The slope m is called in the following Time Gradient of the 
image. The sign of this parameter is positive if the arrival time increases 
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as we move away from the location of the source on the camera, negative 
otherwise. It is therefore a parameter which depends on the position of the 
candidate gamma-ray source. 
• Time RMS: the root mean square of the arrival times of all pixels belong- 
ing to the image after cleaning. It measures the spread of the arrival times 
irrespective of the pixel position in the camera. This parameter has been 
suggested as a possible background discriminator in [6]. It must be noted 
that due to the time structure of the events, this parameter is correlated 
with the Time Gradient. 



TIME GRADIENT vs IP I I DISTvsIpI 




Impact Parameter (m) Impact Parameter (m) 

Fig. 3. Scatter plot of Time Gradient vs IP (left) and scatter plot of Dist vs IP 
(right) for Monte Carlo 7-events. A cut Size > 200 phe has been applied. Note that 
the impact parameter correlates better with Time Gradient for distant showers, 
whereas the correlation is better with Dist for nearby showers. The information 
provided by the two image parameters is hence complementary. 

For the geometrical reasons explained in the previous section, the Time Gra- 
dient is well correlated with the impact parameter, as can be seen in the left 
panel of figure [31 On the other hand, the classical Dist parameter, which is 
the angular distance from the image center of gravity to the source location 
on the camera, is also correlated to the impact parameter for gamma rays 
coming from a point-like source: as we increase the impact parameter, the 
image gets longer and moves away from the source, as we observe it at an 
increasingly larger ang A consequence of this is that Time Gradient is 
correlated with Dist for gamma-ray images from a point-like source (see fig. 
H]), whereas no such correlation exists for hadron images, since hadron showers 
are distributed isotropically, and therefore no strong correlation of Dist and 
the impact parameter is expected for them. Already from this, one can expect 
some improvement in the background discrimination through the use of the 
Time Gradient in the analysis. 



^ Fluctuations in the shower development make that, even for a fixed energy, the 
altitude at which it develops changes from event to event, which blurs the correlation 
of Dist and the impact parameter. 
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Another way of looking at this is the following: the shower direction is not well 
determined by a single lACT. When observing a point-like source, all gamma- 
ray images will be pointing (within ~ 10°) towards the source location on 
the camera, but so will many background cosmic ray showers whose axes are 
coplanar with the line pointing from the mirror dish center toward the source. 
The bare shower shapes allow to eliminate some of those, but the timing profile 
provides additional independent information to recognize the gamma rays (the 
images with "consistent" values of Dist and the Time Gradient) and reject 
the background, and is therefore expected to improve the performance of the 
analysis. 

Note that in stereoscopic lACT systems the determination of the shower direc- 
tion and the impact parameter is obtained by the intersection of the multiple 
shower images, and thus the information that could be provided by the tim- 
ing is redundant^]. Therefore, the results obtained in this study should not be 
extrapolated to the case of stereo observations. 

Regarding the Time RMS, it has been suggested in [B] that it may be of help in 
identifying triggers produced by single, large impact parameter muons (whose 
images may otherwise be gamma- like), as well as other hadron- initiated show- 
ers (since their Time RMS distribution has, with respect to that of gamma 
rays, a longer tail towards large values). 

2.4 Role of the Monte Carlo simulation 

Making sure that the Monte Carlo reproduces the features of the real data is 
very important to perform a good background rejection and energy estimation. 
In the MAGIC analysis, both tasks rely heavily on the MC simulated events. 
The MC is also crucial when the gamma-ray flux of a source is computed, 
since the estimation of the collection area is done using a Monte Carlo "test" 
sample. Therefore, the detector simulation has been updated to reproduce the 
digitization features of the new 2 GS/s digitization system: beyond the higher 
digitization speed, also the level of electronic noise and the overall precision 
of the time determination have been adjusted, taking into account the entire 
electronics chain. The time resolution can be estimated from the calibration 
events (light pulses of ~2 ns duration), looking at the distribution of the arrival 
time difference between any two camera pixels. The RMS of the distribution 
is 550 ps. This correspond to a time resolution for a sing le pixel of 550/^2 = 
390 ps (see [9j for details). Actually, Cherenkov pulses are generally faster 
than the calibration pulses, and hence, for a pulse of comparable amplitude, 
showers signals have a better resolution. 

^ In the special case of a two telescopes stereo system, the impact parameter can 
still be poorly determined for some degenerate events. 
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I DIST VB TIME-GRADIENT (MC) 



I DIST VS TIME-GRADIENT (Real gammas) I 



DIST vs TIME-GRADIENT (OFF data) 



DIST [deg] 

Fig. 4. Time Gradient versus Dist parameter correlation for MC 7-rays (left), real 
7-rays (center) and background (right) data. 

A demonstration of the Time Gradient - Dist correlation described in section 
12.31 can be seen in figure |H The left plot of the figure is made with pure 7-MC 
events while the central panel displays the difference between on-source and 
off-sonrce distributions (from a Crab sample described later), and therefore 
shows the distribution of the gamma ray excess. A correlation Time Gradient - 
Dist is present in both cases. Such correlation is almost completely suppressed 
for hadron images (even after a cut in the Alpha parameter), as shown in the 
plot on the right. 



3 Experimental results 



MAGIC observations are performed mainly in two modes: on-ojf and wobble. 
In the former, the telescope points directly at the source to obtain the on- 
source data whereas the off data, used to estimate the background, are taken 
by pointing at a region of the sky where no signal is expected. The wobble 
mode eliminates the need for taking dedicated ojf runs [20]. The telescope 
is not aimed directly at the source, but slightly off (0.4° away). In this way, 
the source does not occupy a privileged position in the camera, and the back- 
ground can be estimated by re-doing the analysis with respect to points on 
the camera ( "false-sources" ) which are expected to be equivalent to the source 
location (for instance the point symmetric to the source with respect to the 
camera center). The "wobbling" consists in changing the telescope pointing 
every twenty minutes between two symmetric sky directions around the source, 
which is an additional guarantee of the equivalence of source and false-source 
against effects like inhomogeneities in the camera response or the dependence 
of acceptance with the zenith angle. For the studies presented in this paper, 
the background has always been estimated from one single false-source, lo- 
cated opposite to the source w.r.t. the camera center. The main disadvantage 
of the wobble method is a small reduction of the trigger efficiency leading to 
a reduction of ~ 15-20% in the nominal flux sensitivity, since the trigger area 
is limited to ~ 1° around the camera center. 
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The data sample chosen for this study consist of 5.7 h of Crab Nebula obser- 
vations performed in wobble mode during the nights of the 7*'^, 9*'*, IS*'' and 
17*^ of February 2007 (soon after the installation of the new MUX FADCs 
readout) at a zenith angles smaller than 30°. Weather conditions were good 
during all the nights considered. 

3.1 Analysis comparison strategy 

In order to compare the sensitivity with and without the help of the timing 
information, three different analyses of the above mentioned Crab Nebula data 
sample were performed: 

(1) The standard analysis commonly performed on the MAGIC data before 
the upgrade of the DAQ. The image cleaning levels were 10 and 5 phe 
(see section [?!T1) . and no time information was used. The standard im- 
age parameters {Size, Width, Length, Dist, Cone and the third moment 
along the major axis, dubbed M3lon(ji^ were used to perform the 7/h 
separation. This is the reference analysis for the comparison. 

(2) An analysis using 6-3 phe as cleaning levels, with the time constraints 
described in section [?!T1 The same standard parameters of analysis 1 were 
used for 7/h separation. This analysis is meant to evaluate the effect of 
the time cleaning. 

(3) The same 6-3 phe time cleaning of analysis 2 is used. In this analysis, 
in addition to the standard image parameters, the Time RMS and the 
Time Gradient image parameters (see section 12. 3p were used as input for 
the background rejection. This analysis is meant to evaluate the analysis 
improvement due to the timing parameters (used together with the time 
cleaning) . 

In all cases the image parameters were the input to the Random Forest (RF) 
event classification algorithm [21], which was used to perform the 7/h sepa- 
ration task. The training samples for the construction of the RF are a MC 
gamma sample, and a sample of real off data to represent the background. 
When applied to the data, the RF tags each event with a single value called 
Hadronness (ranging from to 1) which is a measure, based on the image 
parameters, of the likelihood that the event is a background event. 

The sum of the signals (in phe) of the two pixels with highest signal {Size- 
2) was used as parameter to select event samples of different energies. Like 

^ This measures the image asymmetry along its major axis. It is a source-dependent 
parameter, since its sign is referred to the source position on the camera. The sign 
is defined such that it is positive when the shower head is closer to the source than 
the shower tail, as is the case for properly reconstructed gamma rays. 
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the classical event Size, Size-2 is correlated with energy, but unlike Size, it 
is very weakly dependent on the cleaning level£ll- If we had chosen the total 
Size to define the samples, we would have faced the problem that they would 
correspond to different energies in the three analyses, therefore making the 
interpretation of the results more difficult. 



Three different bins of Size-2 are considered in this work: the first one {Size- 
2 > 100 phe) corresponds to the energy range where the integral fiux sen- 
sitivity of MAGIC is best (resulting in a peak gamma energy of around 280 
GeV); the second bin (40 phe < Size-2 < 100 phe) is intended to study the 
performance at intermediate energies (peak energy ~ 150 GeV). Finally, the 
performance for gamma rays below 100 GeV, which will be discussed in a 
separate section, has been evaluated in the Size-2 range from 20 to 40 phe. 
The estimated energy distributions for the excess events in each of the three 
Size-2 bins just mentioned (obtained from the real data sample) are shown in 
figure O 



Crab excess events, SIZE-2 > 1 00 phe ^ 



Crab excess events, SIZE-2 = 40-100 phe 





















; + - 


f 






^ 






++++ +^ 





? 400 r 

1.350 r 

g 300 r 
> 

250 r 
200 r 
150 r 
100 r 
50 r 
Or- 



1000 1500 2000 

Estimated energy (GeV) 



600 800 1000 

Estimated energy (GeV) 



I Crab excess events, SIZE-2 = 20-40 phe~ 



600 7 

: 

I 500 7 
'4007 
300 7 
200 7 
100 7 
— 



Mean 81.55 
RMS 39.16 



50 100 150 200 250 300 350 400 
Estimated energy (GeV) 



Fig. 5. Distribution of estimated energy for the Crab Nebula excess events (analysis 
3) in the three ranges of Size-2 considered: above 100 phe, from 40 to 100 phe and 
from 20 to 40 phe. 



^ Considering two different image cleanings and applying the same Size-2 cut, the 
two data samples obtained will contain essentially the same events, differing only 
in the events that survive just one of the cleanings. 
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3.2 Background rejection 



For the two higher Size-2 bins considered, a series of three Alpha plots are 
shown in figures and [71 The first Alpha plot is relative to the standard 
analysis (1), the second to the time cleaning analysis (2) and the third to the 
time cleaning and time parameters (3). In the case of analysis 1, the Alpha and 
Hadronness cuts are optimized to obtain the best statistical significance of the 
excess. For the analyses 2 and 3, the Hadronness cut was chosen so that we 
got the same number of excess events as in analysis 1 (after applying the same 
Alpha cut). In this way we can easily compare the background suppression 
provided by each analysis procedure. Note that the histograms with error 
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Fig. 6. Crab Nebula Alpha plots (excess and residual background) obtained with 
the three tested analysis methods. The Size-2 parameter is above 100 phe, corre- 
sponding to an energy distribution peak of ~ 280 GeV. Fixed the optimal cut for 
analysis 1, the other cuts are chosen in order to have roughly the same number of 
excess events in all three analyses. 



Analysis 


HADR. cut Alpha cut 


Excess 


Background 


O'LikMa/Vh 




(deg) 


(7/min) 


(events / min) 




1 


0.09 8 


3.78±0.13 


1.08±0.06 


12.5 


2 


0.10 8 


3.75±0.13 


1.14±0.06 


12.3 


3 


0.07 8 


3.78±0.12 


0.57±0.04 


14.0 



Table 1 

Statistics of the plots in figure^ {Size-2 > 100 phe; Ep^ak — 280 GeV), obtained 
with 5.7 h of observation. 
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Fig. 7. Crab Nebula Alpha plots (excess and residual background) from analysis 
1, 2 and 3, in the Size-2 range from 40 to 100 phe, corresponding to an energy 
distribution peak of ~ 150 GeV. 



Analysis 


HADR. cut 


Alpha cut 


Excess 


Background 


O'LiSzMa/Vh 






(deg) 


(7/min) 


(events / min) 




1 


0.10 


12 


3.00±0.23 


7.58±0.15 


5.5 


2 


0.09 


12 


3.01±0.21 


5.62±0.13 


6.2 


3 


0.07 


12 


3.12±0.17 


3.29±0.10 


7.8 



Table 2 

Statistics of figure[7](40 phe < Size-2 < 100 phe; Epeak — 150 GeV), obtained with 
5.7 h of observation. 



bars represent the Alpha distribution of the excess events, instead of the usual 
plot showing the on-source data before the background subtraction. In this 
way we can immediately see that the gamma excess is similar in all three 
analyses, regardless of the background level. 

The main result from this comparison is that the use of the time cleaning and 
the time parameters allows to halve the residual background while keeping 
the same number of excess events, with respect to the analysis using no time 
information. This can clearly be seen in figures [6] and [7] and the corresponding 
tables. Tab. [1] and [2l Note that the quoted significance values are calculated 
using only one false-source position for the background estimation, so the ratio 
of on-source to Oj^-source exposure is one. The results for the lowest Size-2 
bin will be discussed in section 13. 4[ 



15 



3.3 Flux sensitivity to point sources 



From the results of the Crab Nebula observations we can estimate the flux 
sensitivity to point sources achievable with the different analyses. We define 
the flux sensitivity as the minimum gamma ray flux detectable in 50 hours, 
where "detectable" means that the excess of gamma rays corresponds to a 
signal to noise ratio of five {Nexc/ \J^bg = 5). This is the standard definition 
commonly used in the field, but note that it does not correspond exactly 
to a "5(7 detection", because the real significance is usually computed with 
the Li and Ma formula [22] which takes into account the uncertainty in the 
determination of the background. 
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Fig. 8. Sensitivity (for 50 h) curves as function of the rate of gamma rays after 
cuts. The black triangles, blue squares and red circles correspond to analysis 1, 
2, and 3, respectively. Upper panel: Size-2 > 100 phe, corresponding to an energy 
distribution peak of ~ 280 GeV. Lower panek 40 < Size-2 < 100 phe, corresponding 
to an energy distribution peak of ~ 150 GeV. The curves are obtained by a scan of 
the cut in the Hadronness parameter. 
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The flux sensitivity depends on the strength of the background discrimination 
cut {Hadronness < Hmax)- Actually, the cuts which maximize the statistical 
significance of the excess from a strong source like the Crab Nebula, as used 
in the previous section, are not the ones resulting in the best flux sensitivity: 
weak sources require tighter cuts. In this section we present the results of a 
scan of the Hadronness cut values, shown in figure [8j the flux sensitivity (in 
percentage of the flux of the Crab Nebula) is plotted as a function of the rate 
of excess events. Each Hadronness cut of the scan leads to a different rate 
of excess and background events and thus to a different flux sensitivity. The 
figures correspond to the two Size-2 bins considered in the previous section. 
The black triangles represent the standard analysis 1, whereas the blue squares 
and red circles refer to the analysis 2 (with the time cleaning) and 3 (time 
cleaning and time-related parameters). Note that, since the values are derived 
from real Crab Nebula observations, the flux percentage is relative to the true 
Crab flux, and not to the simple power-law spectrum that is often assumed 
in sensitivity estimates based on MC. For each choice of Hadronness, a fixed 
Alpha cut (of 7° and 10° respectively) was applied in order to compute the 
sensitivity. The improvement coming from the use of timing in the analysis 
is clear in both cases. It must be noted that in the higher energy bin, all of 
the improvement comes from the use of the timing parameters, whereas in 
the lower one the introduction of the time cleaning already results in some 
improvement in sensitivity. The best integral sensitivity that can be reached 
is around 1.6% of the Crab flux for a peak energy of 280 GeV (left panel of 
figure [HD . 

We have computed also the flux sensitivities in differential bins of estimated 
energy for analysis 3, shown in table [31 



Eest range 


H. cut 


a cut 


Excess 


Backg. 


Sensitivity 


(GeV) 




(deg) 


(7/min) 


(events / min) 


(% Crab) 


100 <E < 200 


0.02 


12 


0.70±0.06 


0.23±0.03 


6.3 


200 <E < 300 


0.02 


8 


0.80±0.06 


0.11±0.02 


3.7 


300 < < 500 


0.02 


8 


l.OOibO.06 


0.09±0.02 


2.8 


500 < ^ < 1000 


0.04 


4 


0.79±0.05 


0.03±0.01 


2.1 


E > 1000 


0.06 


4 


0.28±0.03 


0.005±0.003 


2.1 



Table 3 



Sensitivity (% Crab in 50 h) and statistics for some differential energy bins using 
the time cleaning and the timing parameters in the analysis 3 of section 13. 1[ Cuts 
are optimized separately in each bin with the best sensitivity criteria. Observation 
time: 5.7 h. 
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3-4 Use of timing at lower energies 



The background suppression capabilities degrade as we move towards lower 
energies. This trend can be clearly seen by comparing figures and [7] and 
their corresponding tables: if we focus on analysis 3, we notice that we move 
from having a signal nearly seven times larger than the residual background, 
to having a signal (integrated below the Alpha cut) slightly smaller than the 
background. This is mainly a result of the worsening of the gamma/hadron 
discrimination and of the fact that the spectrum of the Crab Nebula is harder 
than that of the background, although this latter contribution is smaller. 

In figure M "we show the results for analyses 2 and 3 in the Size-2 range from 
20 to 40 phe, where most of the excess comes from sub- 100 GeV gamma rays 
(see third pad of figure [5]). Given the modest signal (a mere 5.7 cr significance 
in analysis 3), we have in this case adjusted the cuts to obtain the same 
background rate (~ 80 events/min) in both analyses, and then compared the 
gamma ray excesses. Once more, the improvement in performance due to the 
introduction of the timing is clear, though less significant due to the large 
residual background. With roughly the same background rate the excess rate 
for analysis 2 is 2.5±0.7 7/min whereas for analysis 3 it is 4.0±0.7 7/min. In 
this energy range, analysis 1 even fails to produce a significant signal, due to 
the high cleaning levels. 



ALPHA plot (analysis 2n 



ALPHA plot (analysis 3n 
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Fig. 9. Crab Nebula Alpha plots (excess and residual background) from analysis 
2 and 3 in the bin of Size-2 from 20 to 40 phe (estimated energy of the gammas 
mostly below 100 GeV). Note that in this case we have adjusted the background 
rate instead of the excess rate to be the same in both cases. The Alpha cut applied 
is of 12 deg. 



For these low energies, the background overwhelms the signal even of a strong 
source like Crab, and is actually setting a further limitation for the observation 
of weak sources. The signal must not only be statistically significant, but also 
well above the systematic uncertainty in the determination of the background, 
which is at least of a few percent and unlike statistical significance, it does 
not get better with longer observation times. 

It has to be noted that, even after strong background rejection cuts, a certain 
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amount of "irreducible" background survives. The background rejection power 
of the imaging technique degrades fast with the lowering of the energy of the 
primary gamma-ray. The reason of this degradation can be attributed both 
to the physics of the air showers and to the technical limitations of the lACT 
instruments. The irreducible background is made up by shower images which 
are similar to gamma induced images in all of the image parameters used for 
the event discrimination. This may reflect the shortcomings of the instrument 
in recording the small and faint images of low energy showers, resulting from 
its limited light collection efficiency and camera pixelization. A larger reflect- 
ing surface or an increased quantum efficiency camera, together with a finer 
pixelization, would obviously improve the accuracy of the reconstructed image 
parameters. On the other hand, even assuming a perfect lACT detector, there 
is no guarantee that the intrinsic characteristics of the cascades are different 
enough to permit to distinguish the nature of the primary particles. The study 
of the characteristics of the gamma-like background (see for example [23] or 
|24] ) heavily rely on the MC simulation packages. The nature of this irreducible 
background is attributed to n^-s, primary electrons and long flying relativistic 
particles (like fi-s). Proton-induced air showers typically produce pions in the 
flrst interaction stage. The charged pions decay into muons, whereas the vr^-s 
(most often decaying into two gammas) originate electromagnetic sub-showers. 
If the energy of the primary particle is above a few hundred GeV, the superpo- 
sition of the sub-showers from different vr^-s with diverging trajectories make 
hadronic showers to appear wider and more "patchy" than gamma-initiated 
ones. As is well known, this is the feature of hadron-initiated cascades which 
allows best to suppress them in the analysis of lACT data. However, as the 
energy of the hadronic primary goes down, the pion multiplicity drops, and 
the chances that the light of a single electromagnetic subshower dominates 
the image recorded by an lACT get larger. This means that, regardless of the 
characteristics of the telescope, the amount of irreducible hadronic background 
will necessarily increase as we go down in energy. 

Also the background images due to distant muons can easily get confused, by 
a single-dish lACT, with low energy gammas. Those images have generally a 
small size which make it difficult to recover information from the shower shape. 
The time spread of pixel signals has been proposed in [6] as a parameter which 
may be used to suppress such background, but as we will discuss in section HI 
the method does not seem to work efficiently for MAGIC. The improvement 
in sensitivity in the < 100 GeV energy range is due to the contribution of the 
Time Gradient parameter. 

In summary, even though the overall background discrimination worsens very 
fast with decreasing energy, the use of the timing parameters (mainly the Time 
Gradient) has been shown to improve background suppression efficiently in the 
whole energy range of MAGIC, even below 100 GeV. 
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3.5 Application of timing analysis to older MAGIC data 

In the previous sections we have discussed exclusively MAGIC data taken 
after the installation of the fast readout in February 2007. In earlier MAGIC 
data, the signal sampling was made with FADCs with 300 MSample/s. Before 
digitization, the pulses were stretched up to 6 ns FWHM (10 ns for the low 
gain), to ensure proper sampling. Timing information was only used in the 
image cleaning stage of the analysis, to reduce the energy threshold [T^ . Only 
shape parameters were used for background suppression. With that sort of 
"classical" analysis, no improvement in performance could be seen after the 
upgrade of the DAQ system: both for the data before and after the upgrade the 
best integral flux sensitivity achieved in wobble mode was around 2.4% of the 
Crab Nebula flux in 50 hours. The reduction of the signal integration time (and 
subsequent reduction of the integrated NSB noise) does not seem to result, by 
itself, in an improved performance. The reason is that the intrinsic fluctuations 
of the Cherenkov light recorded by a pixel (coming from the Poissonian photon 
statistics) dominate over the fluctuations of the NSB light (with a mean rate of 
about 0.13 phe per nanosecond in an inner pixel), and therefore the reduction 
of the NSB noise does not change signiflcantly the precision of the charge 
measurement in a pixe0. 

Despite the poorer quality of the timing information recorded in data taken 
before February 2007, we have also tried to apply the timing parameters de- 
flned in section [231 to their analysis. A 8.7 hour Crab data sample taken in 
good weather conditions during four nights in December 2006 and January 
2007, shortly before the change of the DAQ system, was used for this test. It 
turns out that the improvement in background suppression brought about by 
the timing parameters on these data is smaller than the one obtained on newer 
data, shown in sections 13.21 and 13.31 The best integral sensitivity, achieved for 
the SIZE-2> 100 phe sample, is 1.9% of the Crab flux. 

Although we have not yet been able to investigate this issue in detail, the most 
obvious reason for the smaller effect of the timing parameters in the analysis 
of MAGIC data taken before the upgrade of the readout is the worse quality 
of the timing information. Nevertheless, the precision of the determination of 
the arrival time of calibration pulses, estimated by studying the reconstructed 
times for different pixels as discussed in section 12.41 is roughly the same for 
both setups (~ 390 ps RMS). Calibration flashes make rather large pulses 
of around 35 photoelectrons per inner pixel [9], and certainly most of the 
pixels in the processed shower images have smaller signals. On top of that. 



° This applies to the "dark night" observations discussed in this paper. For observa- 
tions under moonlight or in twilight, with a higher rate of NSB photons, the shorter 
integration time is indeed an advantage in the reconstruction of small showers. 
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calibration pulses are wider than pulses from showers (4.6 vs. 2.3 ns FWHM). 
That is, calibration pulses are not representative of the bulk of the pulses 
which contribute to the images and thus to the time parameters we use in 
the background discrimination. In conclusion, the study mentioned above, 
using calibration pulses, is not in contradiction with the naive expectation 
of an improvement in the accuracy with which pulse times are reconstructed 
with the faster sampling. The better performance of timing parameters in the 
suppression of the background after the FADC upgrade is an indirect evidence 
that this is the case. 



3.6 Energy estimation 

The Random Forest method can also be used for the estimation of a continuous 
variable. It is the standard method used in the analysis of MAGIC data for 
estimating the energy of the showers (under the assumption that they are 
gamma rays). In an analogous way as for the background rejection, the RF is 
trained with MC gammas, whose true energy is known. The main difference is 
in the way the RF is built: the optimal cut in each node of the trees is chosen 
to minimize the variance of the true energies of the event samples resulting 
from the split [21], rather than their purity. No background event sample is 
needed for this kind of training. 

The set of image parameters typically used for the estimation of the energy 
is: Size, Width, Length, Dist, Cone, Leakage and Zenith Angle. In order to 
evaluate the improvement in the energy reconstruction due to the use of the 
signal timing, the energy of a test sample of Monte Carlo 7 events (different 
from the sample used for the training) was evaluated and compared to the 
true known energy of the primary gamma rays. An improvement in the energy 
reconstruction is expected if the image parameter Time Gradient is added to 
the default set of parameters since it provides information about the impact 
parameter of the shower. This should help to avoid the degeneracy between 
small, nearby showers and the large, distant ones. As introduced in section 
12. 3[ both Dist and Time Gradient parameters are well correlated with the IP 
and can be used for its estimation. Notice that the correlation of Dist is rather 
good for small IP event whereas for larger IP the estimation with Dist becomes 
poorer (figure [3]). In case of Time Gradient the correlation is better for higher 
IP values, very likely because for distant showers the time structure of the 
images is more pronounced and as a consequence more precisely measured. 

The distributions of the quantity (i?r-ec — Etme) / Erec for different energy bins 
were used to estimate the quality of the energy reconstruction. In fig. [10] the 
black triangles refer to the energy estimation with the standard parameter 
set, that is, data processed with the 10-5 phe image cleaning without time 
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Fig. 10. Graph of the value of the RMS with respect to zero of the quantity 
{Eest — Etrue) / Eest for different energy bins. This value is an overall estimator of 
the energy resolution. The black triangle and the blue square show data processed 
with the standard set of energy estimation parameters (image cleaning 10-5 phe and 
time image cleaning 6-3 phe respectively). The red circles correspond to data pro- 
cessed with time image cleaning and also Time Gradient used as energy estimation 
parameter (time image cleaning 6-3 phe). 



constraints. The blue squares correspond to an energy reconstruction per- 
formed with the standard parameter set and the time image cleaning 6-3 phe 
(see section [2TT1) . while the red circles are obtained from the time-constrained 
image cleaning and the Time Gradient image parameter being added to the 
standard set for the energy estimation. The graph represents the value of the 
RMS of {Eest — Etrue) I Eest with rcspcct to zero instead of the mean value. 
This quantity is preferred to the simple RMS as an overall estimator of the 
quality of the energy reconstruction, since it takes into account not only the 
spread of the distribution, but also a possible bias with respect to zero (see 
[T7] for more details). "Leakage" effects for images located close to the edge of 
the camera could be important when the energy reconstruction is performed 
since the number of photons in the part of the image outside the camera is 
actually not measured. A standard selection cut Leakage < 10% is applied 
in this analysis and tests with tighter Leakage cuts revealed no significant 
changes with respect to the shown results. The use of the Time Gradient im- 
age parameter in the energy estimation yields to a relative improvement in 
energy reconstruction of around 15%. 



22 



4 Discussion 



We have established that a significant improvement of performance of the 
MAGIC Cherenkov telescope, both in terms of flux sensitivity and energy res- 
olution, can be achieved by using the timing of signals in the reconstruction 
of shower images. The reason for the improvement is two-fold: on one hand, 
the time-constrained image cleaning allows to reduce the cleaning charge lev- 
els without adding noise coming mainly from the night sky background light, 
which results in a lower analysis energy threshold. On the other hand, the 
timing profile of the images, represented by the Time Gradient, provides in- 
formation about the shower impact parameter, a relevant quantity which is 
otherwise poorly determined by a single-dish lACT. 

The results presented here have been reproduced also by a different analysis 
within the MAGIC collaboration. This analysis shares the "core" of MARS 
and is based on [25] but the algorithms for the analysis are developed inde- 
pendently, for example fixed cuts in combinations of the image parameters 
are used instead of the Random Forest. Also in this case is the introduction 
of the Time Gradient led to the described improvement in the background 
suppression. 

From the sensitivity graphs of section 13.21 it is possible to conclude that the 
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Fig. 11. Time RMS versus Size of the event. The green color refers to simulated 
gammas, red to simulated muons and blue to clear real muon events (an image 
arc of at least 180° is required). Gammas and muons extensively overlapped in the 
considered parameters space. Notice the absence of events below 500 ps, evidence 
of our blindness to time features below this threshold, and the increase of the time 
RMS values at very low sizes likely due to the worsening of the time resolution for 
very small signals. 
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time cleaning alone results in a significant sensitivity improvement in the low 
energy regime (40 < Size-2 < 100 phe), coming from the increased event 
statistics. At higher energies it does neither improve nor worsen the telescope 
performance significantly although the lower cleaning results in more pixels per 
image. In contrast, the Time Gradient seems to be helpful in the entire energy 
range accessible to MAGIC. This parameter allows to reject hadron showers 
whose images are gamma-like in shape and oriented towards the gamma-ray 
source location on the camera, but whose Dist and Time Gradient parameters 
are not consistent with what is expected for a gamma shower coming from a 
point source. 

The events with very large Time RMS (> 1.5 ns) are rejected thanks to 
the Time RMS parameter. The background rejection power shown by the 
Time RMS in this study is much lower than foreseen in [B]. This is most 
likely due to the too optimistic assumptions regarding the telescope features 
made by the authors of that work with respect to the actual characteristics 
of MAGIC, in particular regarding the reflecting dish. In the final mounting 
the panels of the MAGIC mirror are staggered in a chessboard pattern to 
facilitate their movement and to ensure a proper focusing, and this causes 
the parabolic dish not to be perfectly synchronous. The mirrors staggering 
together with the other sources of time spread in the acquisition chain, like 
the jitter in the transit time of the electrons in the PMTs, lead to a time- RMS 
response larger than expected from the authors of [6j. The value that results 
from almost synchronous input signals, for example muon events, is ~0.7 ns, 
a value comparable with the intrinsic time spread of the low-energy 7-events 
(see figure [TTl) . Therefore, the tagging of single distant muons from just their 
time spread is at the moment not possible. 

In conclusion the use of timing information in the analysis of MAGIC data 
provides a considerably better background suppression and results in an en- 
hancement of about a factor 1.4 of the flux sensitivity to point-like sources, 
as tested on real observations of the Crab Nebula. This gain is equivalent to 
doubling the available observation time. 

Improvements of the order of 15% have been found in the event energy recon- 
struction. In fact the time gradient gives information about the real impact 
parameter of the shower and therefore it helps to distinguish distant high en- 
ergy showers from closer, low energy ones. 

We expect that this type of timing analysis may also be helpful to future 
Cherenkov telescopes. Even if the Time Gradient is very likely not useful for 
stereo lACT systems, this does not exclude that different time-related image 
parameters can be worth for the reduction of the stereo system data. The time 
image cleaning algorithm would be instead worth for either stereo and single 
lACT systems. 
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